216 research outputs found

    Linkage disequilibrium in young genetically isolated Dutch population

    Get PDF
    The design and feasibility of genetic studies of complex diseases are critically dependent on the extent and distribution of linkage disequilibrium (LD) across the genome and between different populations. We have examined genomewide and region-specific LD in a young genetically isolated population identified in the Netherlands by genotyping approximately 800 Short Tandem Repeat markers distributed genomewide across 58 individuals. Several regions were an

    Use of the gamma method for self-contained gene-set analysis of SNP data

    Get PDF
    Gene-set analysis (GSA) evaluates the overall evidence of association between a phenotype and all genotyped single nucleotide polymorphisms (SNPs) in a set of genes, as opposed to testing for association between a phenotype and each SNP individually. We propose using the Gamma Method (GM) to combine gene-level P-values for assessing the significance of GS association. We performed simulations to compare the GM with several other self-contained GSA strategies, including both one-step and two-step GSA approaches, in a variety of scenarios. We denote a ‘one-step' GSA approach to be one in which all SNPs in a GS are used to derive a test of GS association without consideration of gene-level effects, and a ‘two-step' approach to be one in which all genotyped SNPs in a gene are first used to evaluate association of the phenotype with all measured variation in the gene and then the gene-level tests of association are aggregated to assess the GS association with the phenotype. The simulations suggest that, overall, two-step methods provide higher power than one-step approaches and that combining gene-level P-values using the GM with a soft truncation threshold between 0.05 and 0.20 is a powerful approach for conducting GSA, relative to the competing approaches assessed. We also applied all of the considered GSA methods to data from a pharmacogenomic study of cisplatin, and obtained evidence suggesting that the glutathione metabolism GS is associated with cisplatin drug response

    Haplotype Estimation from Fuzzy Genotypes Using Penalized Likelihood

    Get PDF
    The Composite Link Model is a generalization of the generalized linear model in which expected values of observed counts are constructed as a sum of generalized linear components. When combined with penalized likelihood, it provides a powerful and elegant way to estimate haplotype probabilities from observed genotypes. Uncertain (“fuzzy”) genotypes, like those resulting from AFLP scores, can be handled by adding an extra layer to the model. We describe the model and the estimation algorithm. We apply it to a data set of accurate human single nucleotide polymorphism (SNP) and to a data set of fuzzy tomato AFLP scores

    Multi-ethnic GWAS and meta-analysis of sleep quality identify MPP6 as a novel gene that functions in sleep center neurons

    Get PDF
    Poor sleep quality can have harmful health consequences. Although many aspects of sleep are heritable, the understandings of genetic factors involved in its physiology remain limited. Here, we performed a genome-wide association study (GWAS) using the Pittsburgh Sleep Quality Index (PSQI) in a multi-ethnic discovery cohort (n = 2868) and found two novel genome-wide loci on chromosomes 2 and 7 associated with global sleep quality. A meta-analysis in 12 independent cohorts (100 000 individuals) replicated the association on chromosome 7 between NPY and MPP6. While NPY is an important sleep gene, we tested for an independent functional role of MPP6. Expression data showed an association of this locus with both NPY and MPP6 mRNA levels in brain tissues. Moreover, knockdown of an orthologue of MPP6 in Drosophila melanogaster sleep center neurons resulted in decreased sleep duration. With convergent evidence, we describe a new locus impacting human variability in sleep quality through known NPY and novel MPP6 sleep genes.Peer reviewe

    Using an Uncertainty-Coding Matrix in Bayesian Regression Models for Haplotype-Specific Risk Detection in Family Association Studies

    Get PDF
    Haplotype association studies based on family genotype data can provide more biological information than single marker association studies. Difficulties arise, however, in the inference of haplotype phase determination and in haplotype transmission/non-transmission status. Incorporation of the uncertainty associated with haplotype inference into regression models requires special care. This task can get even more complicated when the genetic region contains a large number of haplotypes. To avoid the curse of dimensionality, we employ a clustering algorithm based on the evolutionary relationship among haplotypes and retain for regression analysis only the ancestral core haplotypes identified by it. To integrate the three sources of variation, phase ambiguity, transmission status and ancestral uncertainty, we propose an uncertainty-coding matrix which combines these three types of variability simultaneously. Next we evaluate haplotype risk with the use of such a matrix in a Bayesian conditional logistic regression model. Simulation studies and one application, a schizophrenia multiplex family study, are presented and the results are compared with those from other family based analysis tools such as FBAT. Our proposed method (Bayesian regression using uncertainty-coding matrix, BRUCM) is shown to perform better and the implementation in R is freely available

    An Open Access Database of Genome-wide Association Results

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The number of genome-wide association studies (GWAS) is growing rapidly leading to the discovery and replication of many new disease loci. Combining results from multiple GWAS datasets may potentially strengthen previous conclusions and suggest new disease loci, pathways or pleiotropic genes. However, no database or centralized resource currently exists that contains anywhere near the full scope of GWAS results.</p> <p>Methods</p> <p>We collected available results from 118 GWAS articles into a database of 56,411 significant SNP-phenotype associations and accompanying information, making this database freely available here. In doing so, we met and describe here a number of challenges to creating an open access database of GWAS results. Through preliminary analyses and characterization of available GWAS, we demonstrate the potential to gain new insights by querying a database across GWAS.</p> <p>Results</p> <p>Using a genomic bin-based density analysis to search for highly associated regions of the genome, positive control loci (e.g., MHC loci) were detected with high sensitivity. Likewise, an analysis of highly repeated SNPs across GWAS identified replicated loci (e.g., <it>APOE</it>, <it>LPL</it>). At the same time we identified novel, highly suggestive loci for a variety of traits that did not meet genome-wide significant thresholds in prior analyses, in some cases with strong support from the primary medical genetics literature (<it>SLC16A7, CSMD1, OAS1</it>), suggesting these genes merit further study. Additional adjustment for linkage disequilibrium within most regions with a high density of GWAS associations did not materially alter our findings. Having a centralized database with standardized gene annotation also allowed us to examine the representation of functional gene categories (gene ontologies) containing one or more associations among top GWAS results. Genes relating to cell adhesion functions were highly over-represented among significant associations (p < 4.6 × 10<sup>-14</sup>), a finding which was not perturbed by a sensitivity analysis.</p> <p>Conclusion</p> <p>We provide access to a full gene-annotated GWAS database which could be used for further querying, analyses or integration with other genomic information. We make a number of general observations. Of reported associated SNPs, 40% lie within the boundaries of a RefSeq gene and 68% are within 60 kb of one, indicating a bias toward gene-centricity in the findings. We found considerable heterogeneity in information available from GWAS suggesting the wider community could benefit from standardization and centralization of results reporting.</p

    Germline polymorphisms in SIPA1 are associated with metastasis and other indicators of poor prognosis in breast cancer

    Get PDF
    INTRODUCTION: There is growing evidence that heritable genetic variation modulates metastatic efficiency. Our previous work using a mouse mammary tumor model has shown that metastatic efficiency is modulated by the GTPase-activating protein encoded by Sipa1 ('signal-induced proliferation-associated gene 1'). The aim of this study was to determine whether single nucleotide polymorphisms (SNPs) within the human SIPA1 gene are associated with metastasis and other disease characteristics in breast cancer. METHOD: The study population (n = 300) consisted of randomly selected non-Hispanic Caucasian breast cancer patients identified from a larger population-based series. Genomic DNA was extracted from peripheral leukocytes. Three previously described SNPs within SIPA1 (one within the promoter [-313G>A] and two exonic [545C>T and 2760G>A]) were characterized using SNP-specific PCR. RESULTS: The variant 2760G>A and the -313G>A allele were associated with lymph node involvement (P = 0.0062 and P = 0.0083, respectively), and the variant 545C>T was associated with estrogen receptor negative tumors (P = 0.0012) and with progesterone negative tumors (P = 0.0339). Associations were identified between haplotypes defined by the three SNPs and disease progression. Haplotype 3 defined by variants -313G>A and 2760G>A was associated with positive lymph node involvement (P = 0.0051), and haplotype 4 defined by variant 545C>T was associated with estrogen receptor and progesterone receptor negative status (P = 0.0053 and P = 0.0199, respectively). CONCLUSION: Our findings imply that SIPA1 germline polymorphisms are associated with aggressive disease behavior in the cohort examined. If these results hold true in other populations, then knowledge of SIPA1 SNP genotypes could potentially enhance current staging protocols

    Potential Genetic Risk Factors for Chronic TMD: Genetic Associations from the OPPERA Case Control Study

    Get PDF
    Genetic factors play a role in the etiology of persistent pain conditions, putatively by modulating underlying processes such as nociceptive sensitivity, psychological well-being, inflammation, and autonomic response. However, to date, only a few genes have been associated with temporomandibular disorders (TMD). This study evaluated 358 genes involved in pain processes, comparing allelic frequencies between 166 cases with chronic TMD and 1442 controls enrolled in the OPPERA (Orofacial Pain: Prospective Evaluation and Risk Assessment) study cooperative agreement. To enhance statistical power, 182 TMD cases and 170 controls from a similar study were included in the analysis. Genotyping was performed using the Pain Research Panel, an Affymetrix gene chip representing 3295 single nucleotide polymorphisms, including ancestry-informative markers that were used to adjust for population stratification. Adjusted associations between genetic markers and TMD case status were evaluated using logistic regression. The OPPERA findings provided evidence supporting previously-reported associations between TMD and two genes: HTR2A and COMT. Other genes were revealed as potential new genetic risk factors for TMD, including NR3C1, CAMK4, CHRM2, IFRD1, and GRK5. While these findings need to be replicated in independent cohorts, the genes potentially represent important markers of risk for TMD and they identify potential targets for therapeutic intervention
    corecore